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Abstract — We characterize the capacity of the general class of 
noncoherent underspread wide-sense stationary uncorrelated scat- 
tering (WSSUS) time-frequency-selective Rayleigh fading chan- 
nels, under peak constraints in time and frequency and in time 
only. Capacity upper and lower bounds are found which are ex- 
plicit in the channel's scattering function and allow to identify 
the capacity-maximizing bandwidth for a given scattering function 
and a given peak-to-average power ratio. 

I. Introduction 

A well-known result in information theory states that, in 
the infinite-bandwidth limit, the capacity of a time-frequency 
(TP) selective, wide-sense stationary uncorrelated scattering 
(WSSUS) [1] fading channeQequals the capacity of an additive 
white Gaussian noise (AWGN) channel with the same receive 
signal-to-noise ratio (SNR); capacity is achieved by codebooks 
that are "peaky" in both time and frequency [2], [3]. It is also 
well-known that the AWGN channel capacity cannot be achieved, 
in the infinite-bandwidth limit, if a peak constraint is imposed [4], 
[5]. In particular, as detailed below, different forms of peak 
constraints lead to different infinite-bandwidth capacity behavior. 

Peak constraint in time: A closed-form expression for the 
capacity of a WSSUS TF-selective fading channel under a 
peak constraint in time is not available in the literature. The 
rate achievable by frequency-shift-keying (FSK), in the infinite- 
bandwidth limit, has been shown by Viterbi [6] to be given by 

v ' 

C'aWGN,oo 

(1) 

where P is the received power, A^o stands for the one-sided noise 
spectral density and S{i') is the power-Doppler profile [1] of 
the TF-selective fading channel process. The infinite-bandwidth 
rate achieved by FSK constitutes a lower bound on the infinite- 
bandwidth capacity of the channel and equals the infinite- 
bandwidth capacity Cawgn,oo of an AWGN channel (with the 
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' Throughout the paper, whenever we speak of fading channels, we shall refer 
to (without explicitly stating it) the "noncoherent" case where neither transmitter 
nor receiver have access to channel state information (CSI) but both are aware 
of the channel law. 



same receive SNR) minus a penalty term resulting from the peak 
constraint. In the absence of a peak constraint, the second term 
on the right-hand-side (RHS) of ([T]) can be made as small as 
desired by transmitting the FSK codewords with arbitrarily low 
duty cycle, and AWGN channel capacity can be achieved in the 
infinite-bandwidth limit [2, Sec. 8.6]. 

Expressions similar to have been found in [7] for the 
capacity per unit energy and in [8], [9] for the infinite-bandwidth 
capacity of time-selective, frequency-flat Rayleigh fading chan- 
nels under a peak constraint in time. 

Peak constraint in time and frequency: From the results in [4], 
[5] it follows that, under a peak constraint in time and frequency, 
the capacity of a TF-selective WSSUS fading channel goes to 
zero as the bandwidth approaches infinity. An upper bound on the 
rate achievable by transmitting constant modulus symbols over a 
TF-selective WSSUS underspread [10] fading channel has been 
obtained in [11]. This upper bound is explicit in the channel's 
scattering function and hints at the existence of a scattering- 
function-dependent, capacity-maximizing bandwidth. 

Contributions: We consider the general class of under- 
spread [10] continuous-time WSSUS TF-selective Rayleigh 
fading channels under peak constraints (i) in time and (ii) in 
time and frequency]^ Our specific contributions are summarized 
as follows: 

• Peak constraint in time: We recover Viterbi's infinite- 
bandwidth lower bound ([T]) using an alternative proof. 

• Peak constraint in time and frequency: We generalize 
the results in [11], which dealt exclusively with constant 
modulus inputs, by providing upper and lower bounds on 
capacity. Both bounds are explicit in the channel's scattering 
function and reveal the existence of an optimum, capacity- 
maximizing bandwidth, which depends on the scattering 
function and the peak-to-average power ratio. 

Proof techniques: Our entire analysis is built on the fact that 
underspread channels have a well-structured set of TF-localized 
approximate eigenfunctions [10]. The main proof techniques 
used in the paper are based on the relation between mutual infor- 
mation and minimum mean-square error (MMSE) discovered 
in [12], on a generalization of Szego's theorem on the asymptotic 
eigenvalue distribution of Toeplitz matrices [13] to the case of 

-The peak constraints, made precise in Section [n] will be imposed in the 
channel's eigenspace rather than on the continuous-tune transmit signal. 



block-Toeplitz matrices with Toeplitz block^[14], and on a 
property of the information divergence of orthogonal signaling 
schemes first presented in a different context in [15]. 

Notation: Uppercase boldface letters denote matrices and 
lowercase boldface letters designate vectors. The superscripts ^ , 
^ and * stand for transpose, conjugate transpose and element- 
wise conjugation, respectively. det(X) denotes the determinant 
of the matrix X, I stands for the identity matrix of appropriate 
size, and diag{x} denotes a diagonal square matrix having 
the elements of the vector x on its main diagonal. E{} is the 
expectation operator. S{t) stands for the Dirac delta function, 
and 6ij — 1 if i ~ j and else. All logarithms are to the base 
e. If a random variable (RV) x has distribution Vx, we write 
X ^ Vx- Finally, CN{Q,a'^) denotes a circularly symmetric 
complex Gaussian RV with variance cr^ . 

II. Channel and System Model 

We consider an ergodic WSSUS TF-selective Rayleigh fading 
channel H with input-output relation 

r(i) = (Ha;)(i) = j h{t,t ~ t) x{t - T)dT (2) 

where the impulse response h{t, t') is a two-dimensional zero- 
mean complex Gaussian random process. The time-varying 
transfer function of the channel is defined as [1] 

The channel's scattering function Ch(t, v) [3] is given by 

Ch(t, jj^ Mt, f) e-^^^'^'-^f^tdf 

where i?H(i, /) = E{LH(t + <',/ + /') ^K*', /')} denotes 
the channel's TF-correlation function. We invoke the common 
assumption [10] of a scattering function that is compactly sup- 
ported within the rectangle [—To, To] X [— z/g, z/q], i.e., Ci[(t, v) = 
for (r, v) ^ [—To, tq] x [— t'o, vq]- Defining the spread of the 
channel as the area of this rectangle, Ae = '^tqVq, the channel is 
said to be underspread if Ag < 1 and overspread if Ag > 1 [10]. 
The underspread assumption is relevant as most mobile radio 
channels are highly underspread. 

The key idea turning (|2| into a discrete problem is to recognize 
that underspread channels are approximately diagonalized by 
orthonormal Weyl-Heisenberg bases [10], which are obtained by 
TF-shifting of a normalized function g{t) according to gk.n{t) — 
g{t — fcr)e-'^^"^*, where the grid parameters T and F have to 
satisfy TF > 1. For T < l/(2z^o) and F < 1/(2to), and hence 
TF < 1/Ah, the impulse response h{t,t') of an underspread 
fading channel satisfies [16] 

C30 oo 

h{t,t')^ E E LM{kT,nF)gk,n{t)gi^{t'). (3) 

fc=— oo ri=— oo 

The scalar channel coefficients L]i(fcT, nF) are circularly sym- 

^In the remainder of the paper, block-Toeplitz matrices with Toeplitz blocks 
will simply be referred to as block-Toeplitz matrices. 



metric complex Gaussian RVs with zero mean, variancecr^ = 
Ch(t, drdv, and correlation function 

RmikT, nF) = E{LM{{k + k')T, (n + n')F) L^{k'T, n'F)} . 

Motivated by the (approximate) diagonalization ([3]), we write 
the transmit signal x{t) as 

oo Af-1 

X{t) = E E ^k,n9k,n{i) (4) 
fc— — oc- n—0 

where the Xk^n are the information-bearing data symbols. This 
modulation scheme corresponds to pulse-shaped orthogonal 
frequency-division multiplexing (OFDM) with OFDM sym- 
bol duration T and subcarrier spacing F. The transmit-signal 
bandwidth is given by T4^ = NF. With the received signal 
y{t) = r{t) + z{t) and z{t) circularly symmetric additive white 
Gaussian noise so thagE{z(t)z*(i')} = 5{t - t'), the receiver 
computes the inner products yk.n = {y,9k,n)- Exploiting the 
orthonormality of the basis functions gk,n{t), we obtain the 
overall input-output relation 

yk,n = hk,nXk,n + Zk^n (5) 

where hk^n = Ln{kT, nF) and E{zfe,„z^, „,} = 4,fc'^n,n'- In 
summary, we transmit and receive on the channel's (approximate) 
eigenfunctions (which are TF-translates of the prototype function 
g{t)), thereby realizing a tiling of the TF-plane. For a detailed 
discussion of the approximation ([3]l and the consequences of 
restricting the class of input signals to Q, the interested reader 
is referred to [16]. In the following, we refer to the index k as 
representing the "time-domain" and the index n as representing 
the "frequency-domain". 

We assume that each channel use takes place over N subcarri- 
ers and K OFDM symbols. The A^-dimensional vector contain- 
ing the data symbols transmitted in the fcth OFDM symbol {k — 
0, 1, . . . , if - 1) is denoted as x^ = [2:^,0 a^fc,! ■ ■ • Xk^^iV ■ 
The vectors y^, h^, and are defined correspondingly. Fur- 
thermore, we define the TV-dimensional vector containing 
the data symbols transmitted in one channel use as x = 
[xq • • • x|^,_j^]^. Again, y, h and z are defined correspond- 
ingly. Finally, we denote the covariance matrix of the channel 
vector hby Ch = E{hh^}. Since /ife,„ is WSS in k and n, the 
matrix Ch is block-Toeplitz. 

The input-output relation corresponding to one channel use 
can now be written as 

y = diag{h} x + z. 

Note that the channel in (|5]l will in general not be memoryless 
because the channel gains hk.n are correlated across time index k 
and across frequency index n. Throughout the paper, we assume 
an average-power constraint according to E{||x|p} < KPT. 
In addition, setting Ppoak = PP, with /3 > 1, we impose a peak 
constraint distinguishing the following two cases: 

i) Peak constraint in time and frequency: The data symbols 

^In the remainder of the paper (apart from Section [lII-E} , we normalize the 
one-sided noise spectral density according to A^o = 1- 



satisfy 



< Ppo.i.T/N 



a.s. Vfc, n 



where a.s. stands for almost surely. 
ii) Peak constraint in time: The data symbols satisfy 



(6) 



(7) 



While Condition i) prohibits peakiness in time as well as peaki- 
ness in frequency. Condition ii) prohibits peakiness in time only, 
still allowing for a signaling scheme that is peaky in frequency. 
Note that we impose peak constraints in the channel's eigenspace 
(i.e., on the data symbols Xk.n) rather than on the continuous- 
time transmit signal x{t). 

III. Capacity Bounds under Peak Constraints in 
Time and Frequency 



For a given bandwidth W 
the channel (|5]l is defined as 



NF, the capacity (in nat/s) of 



C(W^)= lim sup/(y;x) 



(8) 



where the supremum is taken over the set of input distributions 
Vx satisfying the average-power constraint E{||x||^} < KPT 
and the peak constraint (|6|. We shall next derive two upper 
bounds and a lower bound on ([H). The first upper bound and the 
lower bound generalize the bounds for the frequency-flat, time- 
selective fading case reported in [8, Prop. 3. 1] and [17, Prop. 2.2], 
respectively, to the TF-selective underspread case. In contrast 
to [8] and [17], our proof techniques do not explicitly rely on the 
relationship between the lag-one mean-square prediction error 
of a WSS process and its spectral measure [18, Th. 4.3]. Instead, 
we use the relation between mutual information and MMSB 
discovered in [12], and a generalization of Szego's theorem on 
the asymptotic eigenvalue distribution of Toeplitz matrices to the 
case of block- Toeplitz matrices [14]. The second upper bound is 
standard and is obtained by assuming perfect CSI at the receiver. 

A. First Upper Bound 

Theorem L The capacity of a TF-selective underspread 
Rayleigh fading channel, with scattering function Ch(t, v), 
under the average-power constraint E{||x|p} < KPT and 
the peak constraint (|6| with Ppoak — PP, is upper-bounded as 

C{W) < UBi(W^), where 

W ^ TF n\ 

UBi(W^) = — log 1 + a(l¥)F— 4 - a{W)A{W, (3) 



with 



TF 



a{W) = min< 1 



W 



W 



TF \A{W,P) Pal 



(9) 



(10) 



and 



A{W, (3) = ^ jJ\og[l + ^Cm{t, I.)) drdz.. 
Proof: A sketch of the proof can be found in Appendix [l] 



The upper bound ([9]l generalizes the upper bound [11, Eq. (2)], 
which holds only for constant modulus signals, i.e., for jx^ „| = 
const., V/c, n. The bounds (|9| and [11, Eq. (2)] are both explicit 
in the channel's scattering function, have similar structure and 
coincide for (3 = 1 when a (14^) = 1 in ([TOji. 



B. Perfect Receive CSI Upper Bound 

A straightforward upper bound on ([8]) is obtained by assuming 
perfect CSI at the receiver and neglecting the peak constraint (|6]), 
but retaining the average-power constraint. The resulting bound 
is given by 



with/i-C7V(0,cr^). 



W 
TF 



Ejlog 1 



PTF 
W 



\h\'- 



(11) 



C. Lower Bound 

Theorem 2: The capacity of a TF-selective underspread 
Rayleigh fading channel, with scattering function C]i(t, z/), 
under the average-power constraint E{||x|p} < KPT and 
the peak constraint ^ with Ppeak = (3P, is lower-bounded, for 
large enough W, as C{W) > LB(VK) = maxi<.^<;3 LB{W, 7), 
where 



LB(W^,7) 



W 



I{y\x\h) 



jTF 

~ 7^//^°^(^^ w'^^^^'^'O'^^'^''' ^^^^ 
The first term on the RHS of ( [T2] i is the perfect-CSI mutual 
information of a scalar channel with input-output relation y = 
hx + z, where h - C7V(0,cr^), \x\'^ = -fPT/N a.s., and z ~ 
CAA(0,1). 

Proof: A sketch of the proof can be found in Appendix [ill 

■ 

Noting that constant modulus constellations are second-order 
optimal in the low-SNR regime (see [19, Th. 14]), we obtain the 
following explicit expression for ([T2|: 



LB(W^,7)«Pa^ 
W 

7 



2 7(P4)^rP 



w 

log(l + ^CM{T,iy)]dTdi,. (13) 



D. Discussion of the Bounds 

The upper bound (|9]l and the lower bound ( [T2| ) have the same 
structure in the sense of being the difference of the mutual 
information of a memoryless channel and a penalty term that 
depends on the fading channel memory. Moreover, both bounds 
are explicit in the channel's scattering function. Unlike the upper 
bound (|9|, the lower bound ( [T2| ) holds only for large bandwidths 
as the penalty term on the RHS of ([12]) admits a closed-form 
(integral) expression only for sufficiently large W. 

E. Numerical Evaluation of the Bounds 

In this section, we provide numerical results based on the 
upper bounds (|9| and ( [TT| ), and on the lower bound ( [T2) i, where 
the first term in ( [T2| ) is evaluated numerically for QPSK mod- 




10 10 
bandwidth (GHz) 



Fig. 1. UBi, UB2, LB, and the approximate lower bound jl3) for /3 = 1. 



ulation using the algorithm proposed in [20]. We use IEEE 
802.11a related system parameters, i.e., TF = 1.25, transmit 
power equal to 1 mW and one-sided noise spectral density 
A^o = 4.14 • 10~^^ W/Hz. The channel's scattering function is 
assumed to be brick-shaped with Ah = lO^'^ (i.e., the channel is 
highly underspread) and path loss cr^ ~ —90 dB. Fig.fllshows 
UBi(W^), UB2(iy), LB(l^) and the approximation ^13| for 
(3—1. We can see that UBi and LB take on a maximum at 
the critical bandwidth of approximately 1 GHz and then ap- 
proach zero as the bandwidth increases. The approximation ([T3| 
is accurate for bandwidths above the critical bandwidth, and 
very loose otherwise. The effect of the capacity decreasing for 
bandwidths exceeding the critical bandwidth, and eventually 
going to zero, is known in the literature as overspreading [4], 
[5], [21]. Overspreading occurs since the peak constraint (|6| 
prohibits peakiness of the signaling scheme in time as well as 
in frequency. 

IV. Infinite-Bandwidth Capacity under Peak 
Constraint in Time 

We shall next relax the peak constraint in time and frequency 
and impose a peak constraint in time only according to (|7|, while 
maintaining the average-power constraint E|||x||^} < KPT. 
In addition, we focus on the infinite-bandwidth limit only. A 
capacity lower bound, which is explicit in the channel's scatter- 
ing function, will allow us to conclude that the overspreading 
phenomenon discussed in the previous section can be eliminated 
by allowing the transmit signal to be peaky in frequency. We start 
by defining the infinite-bandwidth capacity of the channel (|5]) as 

lim sup^/(y;x) 



Co 



lim 

K — *-oo N — *-oo -p^ 



KT 



where the supremum is taken over the set of input distributions 
T'x satisfying the average-power constraint E{||x||^} < KPT 
and the peak constraint (|7]). We derive a lower bound on Coo 
by evaluating \/{KT) ■ /(y;x) in the limit K, N ^ 00 for 
a specific signaling scheme. This signaling scheme, described 
in detail in [16] mimics the FSK scheme used in [6], and can 



be interpreted as a generahzation (to channels with memory in 
time and frequency) of the on-off FSK scheme proposed in [22] 
for memoryless (Ricean) fading channels. The resulting lower 
bound is summarized as follows. 

Theorem 3: The infinite-bandwidth capacity of a TF-selective 
underspread Rayleigh fading channel, with scattering function 
C]i(r, i^), under the average-power constraint E{||x||^} < 
KP T and the peak constraint (|7]) with Ppcak = PP, is lower- 
bounded as Coo > LB(/3), where 



LB(/3) = Pct, 



log 1 + /3PChM d 



(14) 



with Ch(i^) = /j. C]s[{t, v) dr denoting the power-Doppler 
profile [1] of the channel. 

Proof: The proof of this result can be found in [16]. ■ 
The lower bound in ([T4]i coincides with Viterbi's lower 
bound ([Ti when (3 — \. The proof technique used to obtain 
Theorem 3]is, however, conceptually different from that in [6]. 
Specifically, in [6] an appropriate choice of the codebook, namely 
FSK, reduces a WSSUS TF-selective fading channel with scat- 
tering function C]i(r, v) to an effective frequency-flat, time- 
selective fading channel with power-Doppler profile C^{v). A 
Karhunen-Loeve decomposition of the resulting effective time- 
selective fading channel then leads to a closed-form expres- 
sion for the error exponent, which finally yields the capacity 
expression ([TJ. The proof of Theorem [3] on the other hand, 
starts from an eigenfunction decomposition of the WSSUS TF- 
selective channel's impulse response h{t, t'), and establishes an 
infinite-bandwidth capacity lower bound by computing the rate 
achievable for a specific signaling scheme (which mimics FSK). 
The main tool used in the proof of Theorem [5] is a property of 
the information divergence of FSK constellations, first presented 
in a different context in [15]. 

Appendix I 

Sketch of the proof of Theorem^ Fix K and N. Follow- 
ing [8], we use the chain rule for mutual information to write 
the supremum in dSll as 



sup /(y; x) sup{/(y; x, h) - /(y; h|x)} 

< sup < sup /(y ; x, h) - inf /(y ; h|x) 

0<a<l I -p^ 



(15) 



where ([TSj follows by rewriting the supremum over Vx as a 
double supremum over a G [0,1] and over the set of input 
distributions T-'x satisfying E{||xp} = aKPT and the peak 
constraint (|6|. Define X = diag{x}. The first term on the RHS 
of ( [TSl l can be upper-bounded as follows 

(a) 

sup/(y;x,h) < sup logdet(l + EjXChX^}) 

E{||xP}=Q_fs-pT 

K-1 N-1 



< 



E{||x|P}=aKPT 

/ aPT 
< KNlogi 1 



N 



(16) 



where (a) follows by ignoring the peak constraint and upper- 
bounding /(y; X, h) by the capacity of an AWGN channel with 
input Xh, (b) follows from Hadamard's inequality, and (c) 
follows from the concavity of the log function. The second term 
on the RHS of ( [T5] l can be lower-bounded as 

■mil{y-h.\^)>c jJ\og(^ + ^^Cn{r,v)^dTdu (17) 

with c = aKNTF/p. The proof of detailed in [16], is 
based on the relation between mutual information and MMSE 
discovered in [12] and on the closed-form expression for the 
noncausal MMSE of a two-dimensional stationary random pro- 
cess obtained in [23]. Inserting ( fTS] ) and ( [T7] l in ([TSj, dividing 
by KT, and using W = NF, we get 



UBi(W^)= sup J T^logfl 

0<a<l M -f^ \ 



yPTF 



aW 



log 1 



W 



Cmir, v) drdv 



As the function the supremum is taken over is concave in a, the 
maximizing value given in ( fTO] ) is unique and can be obtained 
by differentiating with respect to a. 

Appendix II 

Sketch of the proof of Theorem^ For the sake of simplicity 
of exposition, we outline the proof for (3 — \ only. The result 
for general (3 follows from a simple time-sharing argument [17, 
Corollary 2.1]. We first lower-bound C{W) in (|8]l by assuming a 
specific input distribution, namely, by letting x have independent 
and identically distributed (i.i.d.), constant modulus entries, each 
of which satisfies |xp = PT/N . In the following, vectors x 
with this property will be denoted as y^uA.- Next, we use the 
well-known inequality 

-^(y; Xi.i.d.) = /(y; xy.d., h) - /(y; hlxu.d.) 

>/(y;x,,d.|h)-/(y;h|x,,d.). (18) 
The i.i.d. and the constant modulus assumptions imply that 

/(y;x,,d.|h) = X7V/(y;a:,,d.|/i) (19) 

and 



Ht, h|x. 



i.dj 



log det I 



N 



(20) 



Next, by inserting ([T9| and (|20| in ( fTS) , dividing by KT, and 
taking the limit K ^ oo, we obtain 

N 

C{NF) > LB{NF) = -I{y;xuA\h) 

Finally, we use the generalization of Szego's theorem to block- 
Toeplitz matrices provided in [14] to show that for large 
enough N the second term on the RHS of ( |2T| admits the 



following closed-form approximate integral expression 
1 / PT 

« NfJ^ ^ log (^1 + -^Cm{t, jy)^ drdi.. (22) 

The proof is complete by replacing ( [22] ) in ( |2T] i and substituting 

W = NF. 
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